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Full text available: ^ pdf(4.21 MB) Additional Infomnation: full citation , abstract , references , index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based on process-time diagre 
better understanding of the execution of the application. The visualization tool we use is Poet, an event tracer di 
Waterloo. However, these diagrams are often very complex and do not provide the user with the desired overvie 
experience, such tools display repeated occurrences of non-trivial comnnun ... 
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This paper presents methods for automatically creating pictorial video summaries that resemble comic books. Th 
segments is computed from their length and novelty. Image and audio analysis is used to automatically detect a 
Based on this importance measure, we choose relevant keyframes. Selected keyframes are sized by Importance, 
pictorial summary. We present a quantitative measure of how well a su ... 
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In order to improve the acceptance of recorded presentations, we introduce a new open document type covering 
classes typically appearing in this scenario. Instances of this document type can be replayed using our time-base 
Random access In combination with the realized stream/media-layered synchronization mechanism results In es! 
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W. Zhao, R. Chellappa, P. J. Phillips, A. Rosenfeld 

December 2003 ACM Computing Surveys (CSUR), volume 35 issue 4 

Full text available: ^ pdf(4.28 IVIB) Additional Infonnation: full citation , abstract , references , index terms 

As one of the most successful applications of image analysis and understanding, face recognition has recently re 
especially during the past several years. At least two reasons account for this trend: the first is the wide range o 
enforcement applications, and the second is the availability of feasible technologies after 30 years of research. E 
recognition systems have reached a certain level of maturity, their success is ... 
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Barry Arons 
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Full text available: ^ pdf(1.03 MB) Additional Information: full citation, abstract , references , citings , index terms . 

Listening to a speech recording is much more difficult than visually scanning a document because of the transier 
Audio recordings capture the richness of speech, yet it is difficult to directly browse the stored information. This 
structuring, filtering, and presenting recorded speech, allowing a user to navigate and interactively find informat 
article describes the SpeechSkimmer system for Interacti ... 

Keywords: audio browsing. Interactive listening, nonspeech audio, speech as data, speech skimming, speech ui 
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Full text available: IPl pdf(828.46 KB) Additional Information: full citation , abstract , references , dtiogs, index terms 



Data sets in large applications are often too massive to fit completely inside the computers internal memory. Thi 
communication (or I/O) between fast internal memory and slower external memory (such as disks) can be a ma 
this article we survey the state of the art in the design and analysis of external memory (or EM) algorithms and 
to exploit locality in order to reduce the I/O costs. We consider a varie ... 

Keywords: B-tree, I/O, batched, block, disk, dynamic, extendible hashing, external memory, hierarchical mem( 
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Full text available: ^ pdf(410,37 KB) Additional Information: full citation , abstract, references , index terms 

The authors developed a system In which visually dense displays of thumbnail imagery in storyboard views are l 
retrieval. The views allow for effective retrieval, as evidenced by the success achieved by expert users with the 5 
NISTTRECVID 2002 and 2003. This paper demonstrates that novice users also achieve comparatively high retri( 
using the TRECVID 2003 benchmarks. Through an analysis of the user interact ... 

Keywords: TRECVID, storyboard, video retrieval 



10 Content-based retrieval: VideoQA: question answering on news video 
Hui Yang, Lekha Chaisorn, Yunlong Zhao, Shi-Yong Neo, Tat-Seng Chua 

November 2003 Proceedings of the eleventh ACM international conference on Multimedia 

Full text available: ^ pdf(592.26 KB) Additional Information: full citation , abstract, references , index temns 

When querying a news video archive, the users are interested in retrieving precise answers in the form of a sum 
query. However, current video retrieval systems, including the search engines on the web, are designed to retrie 
answers. This research explores the use of question answering (QA) techniques to support personalized news vie 
our system, VideoQA, using short natural language questions with implicit ... 

Keywords: transcript error correction, video question answering, video retrieval, video summarization 



11 Auto-summarization of audio-video presentations 
Liwei He, Elizabeth Sanocki, Anoop Gupta, Jonathan Grudin 

.October 1999 Proceedings of the seventh ACM international conference on Multimedia (Part 1) 

Full text available:^ pdf(1.55 MB) Additional Information: full citation , abstract, references , citings , index terms 

As streaming audio-video technology becomes widespread, there is a dramatic increase in the amount of multim 
Users face a new challenge: How to examine large amounts of multimedia content quickly. One technique that c 
multimedia Is video summaries; that is, a shorter version assembled by picking important segments from the ori 
techniques for automatic creation of summaries for online audio-video ... 

Keywords: corporate training, digital library, streaming media, user evaluation, user log analysis, video on-den 



12 Spoken dialogue technology: enabling the conversational user interface 
Michael F. McTear 

March 2002 ACM Computing Surveys (CSUR), volume 34 issue i 

Full text available: ^ pdf(987.69 KB) Additional Information: full citation , abstract , references , citings , index terms . 

Spoken dialogue systems allow users to interact with computer-based applications such as databases and expert 
language. The origins of spoken dialogue systems can be traced back to Artificial Intelligence research in the 191 
conversational interfaces. However, it is only within the last decade or so, with major advances in speech technc 
systems have been developed and, in some cases, introduced into commerc ... 

Keywords: Dialogue management, human computer interaction, language generation, language understanding, 
synthesis 
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Yu-Fei Ma, Lie Lu, Hong-Jiang Zhang, Mingjing Li 

December 2002 Proceedings of the tenth ACM international conference on Multimedia 

Full text available: ' ^pdf(644.28 KB) Additional Information: full citation , abstract , references , citings 

Automatic generation of video summarization is one of the key techniques in video management and browsing. ] 
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generic framework of video summarization based on tlie modeling of viewer's attention. Without fully semantic l 
this framework takes advantage of understanding of video content, this framework takes advantage of computal 
eliminates the needs of complex heuristic rules in video summarization. A set of methods ... 

Keywords: attention model, skimming, video content analysis, video summarization 
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Bin Yu, Wei-Ying l^a, Klara Nahrstedt, Hong-Jiang Zhang 

November 2003 Proceedings of the eleventh ACM international conference on Multimedia 

Full text available: ^ pdf(771 .50 KB) Additional Information: full citation , abstract, references, index terms 

Efficient video data management calls for intelligent video summarization tools that automatically generate cone 
sl<imming and browsing. Traditional video summarization techniques are based on low-level feature analysis, wh 
semantics of video content. Our vision is that users unintentionally embed their understanding of the video contt 
computers. This valuable knowledge, which is difficult for computers to I ... 

Keywords: link analysis, log mining, skimming, user behavior, video content analysis, video summarization 
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Harl Sundaram, Lexing Xie, Shih-Fu Chang 

December 2002 Proceedings of the tenth ACM international conference on Multimedia 

Full text available: ^ pdf(487.92 KB) Additional Information: full citation , abstract, references , citings 

In this paper, we present a novel algorithm for generating audio-visual skims from computable scenes. Skims ar 
libraries, and for on-demand summaries in set-top boxes. A computable scene is a chunk of data that exhibits cc 
chromaticity, lighting and sound. There are three key aspects to our approach: (a) visual complexity and gramm 
and (c) an utility model for skim generation. We define a measure of visual c ... 

^7 Evolving video skims into useful multimedia abstractions 

Michael G. Christel, Michael A. Smith, C. Roy Taylor, David B. Winkler 

January 1998 Proceedings of the SIGCHI conference on Human factors in computing systems 

Full text available: ^ pdf(1.02 MB) Additional Information: full citation , references , citings, index terms 



Keywords: digital video library, empirical studies, evaluation, multimedia, video abstraction, video browsing, vi 



Data clustering: a review 

A. K. Jain, M. N. Murty, P. J. Flynn 

September 1999 ACM Computing Surveys (CSUR), volume 3i issue 3 

Full text available: ^ pdf(636.24 KB) Additional Information: full citation , abstract, references , citings , index terms. 
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Clustering is the unsupervised classification of patterns (observations, data items, or feature vectors) into group 
problem has been addressed in many contexts and by researchers in many disciplines; this reflects its broad apf 
steps in exploratory data analysis. However, clustering is a difficult problem combinatorially, and differences in a 
different communities has made the transfer of useful generic co ... 

Keywords: cluster analysis, clustering applications, exploratory data analysis, incremental clustering, similarity 



A confederation of tools for capturing and accessing collaborative activity 

Scott l^inneman, Steve Harrison, Bill Janssen, Gordon Kurtenbach, Thomas Moran, Ian Smith, Bill van Melle 
January 1995 Proceedings of the third ACM international conference on Multimedia 

Full text available: htm(73.96 KB) Additional Information: full citation , references , citings , index terms 



Keywords: CSCW, activity capture, content-and content-based indexing and retrieval, digital audio and video, c 
real-time indexing, usability, user interfaces 

20 Long papers: multinnodal interaction: Multimodal new vocabulary recognition through speech and handwril 

ap plication 
Edward C. Kaiser 

January 2005 Proceedings of the 10th international conference on Intelligent user interfaces 

Full text available: ^ pdf(428.63 KB) Additional Infonmation: full citation, abstract , references , index terms 

Our goal Is to automatically recognize and enroll new vocabulary in a multimodal interface. To accomplish this oi 
mutually disambiguating aspects of co-referenced, co-temporal handwriting and speech. The co-referenced semi 
determined by our multimodal interface for schedule chart creation. This paper motivates and describes our tech 
vocabulary (GOV) terms and enrolling them dynamically in the system. We ... 

Keywords: multimodal interaction, mutual disambiguation, vocabulary learning 
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22 The state of the art in automating usability evaluation of user interfaces 
Melody Y. Ivory, Marti A Hearst 

December 2001 ACM Computing Surveys (CSUR), volume 33 issue 4 

Full text available: ^ p df (2.31 MB) Additional Infomriation: full citation , abstract , references , citings, index terms. 

Usability evaluation is an increasingly important part of the user interface design process. However, usability ev< 
of time and human resources, and automation is therefore a promising way to augment existing approaches. Th 
survey of usability evaluation methods, organized according to a new taxonomy that emphasizes the role of autc 
existing techniques, identifies which aspects of usability evaluation aut ... 

Keywords: Graphical user Interfaces, taxonomy, usability evaluation automation, web interfaces 



23 Three-dimensional ob j ect recognition 
Paul J. BesI, Ramesh C. Jain 

March 1985 ACM Computing Surveys (CSUR), volume i? issue 1 

Full text available: ^ pdf(7.76 MB) Additional Information: full citation , abstract , references , citings, index terms . 

A general-purpose computer vision system must be capable of recognizing three-dimensional (3-D) objects. This 
definition of the 3-D object recognition problem, discusses basic concepts associated with this problem, and revl< 
Because range images (or depth maps) are often used as sensor input instead of intensity images, techniques fc 
characterizing range data are also surveyed. 

24 Advances in domain independent linear text segmentation 
Freddy Y. Y. Choi 

April 2000 Proceedings of tlie first conference on North American cliapter of tlie Association for Con 

Full text available: ^ pdf(828,85 KB) Additional Information: full citation, abstract , references, citings 

This paper describes a method for linear text segmentation which is twice as accurate and over seven times as f 
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(Reynar, 1998). Inter-sentence similarity is replaced by rank in the local context. Boundary locations are discovc 

25 Query evaluation techniques for large databases 
Goetz Graefe 

June 1993 ACM Computing Surveys (CSUR), volume 25 issue 2 

Full text available: pdf(9.37 MB) Additional Information: full citation , abstract, references, citings , index tenns . 

Database management systems will continue to manage large data volumes. Thus, efficient algorithms for acces 
and sequences will be required to provide acceptable performance. The advent of object-oriented and extensible 
this problem. On the contrary, modern data models exacerbate the problem: In order to manipulate large sets o 
today's database systems manipulate simple records, query-processi ... 

Keywords: complex query evaluation plans, dynamic query evaluation plans, extensible database systems, iter; 
systems, operator model of parallellzation, parallel algorithms, relational database systems, set-matching algorit 



26 Charting past present, and future research in ubiquitous computing 
Gregory D. Abowd, Elizabeth D. Mynatt 

March 2000 ACM Transactions on Computer-Human Interaction (TOCHI), volume 7 issue i 

Full text available: ^ pclf(730.83 KB) Additional Information: full citation , abstract , references , citings , index terms 

The proliferation of computing into the physical world promises more than the ubiquitous availability of computir 
paradigms of interaction inspired by constant access to information and computational capabilities. For the past 
research on abiquitous computing (ubicomp) has pushed three interaction themes: natural interfaces, context-av 
capture and access. To chart a cours ... 

Keywords: augmented reality, capture and access, context-aware applications, evaluation, everyday computinc 
implications, ubiquitous computing, user interfaces 



27 Video Retrieval and Browsing: Learning video browsing behavior and its application in the generation of vi 
Tanveer Syeda-Mahmood, Dulce Ponceleon 

October 2001 Proceedings of the ninth ACM international conference on Multimedia 

Full text available: ^pdf(1.86 MB) Additional Infomiation: full citation , abstract, references , citing s, index terms 

With more and more streaming media servers becoming commonplace, streaming video has now become a popi 
advertisement, and entertainment. With such prevalence comes a new challenge to the servers: Can they track 
determine what interest users? Learning this information is potentially valuable not only for improved customer I 
commerce, but also in the generation of fast previews of videos for easy pre-downloads. ... 

Keywords: audio, browsing behavior, interesting content, learning, topics, video previews 



28 Autonnatically extracting hi ghli ghts for TV Baseball programs 
Yong Rui, Anoop Gupta, Alex Acero 

October 2000 Proceedings of the eighth ACM international conference on Multimedia 

Full text available: ^ pdf(1.08 MB) Additional Information: full citation , abstract , references , citings , index terms 

In today's fast-paced world, while the number of channels of television programming available is increasing rapi« 
them remains the same or is decreasing. Users desire the capability to watch the programs time-shifted (on-den 
highlights to save time. In this paper we explore how to provide for the latter capability, that is the ability to ext 
that viewing time can be reduced. 

We focus on the sp ... 

Keywords: audio, baseball, highlights, summarization, television, video 
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29 Workshop reports: Workshop report: the first ACM international workshop on multimedia databases (MMC 
Shu-Ching Chen, Mei-Ling Shyu 
July 2004 ACM SIGIR Forum, volume 38 issue i 

Full text available: gpdf(1 17.52 KB) Additional Infonmatlon: full citation 



30 Computational strategies for object recognition 
Paul Suetens, Pascal Fua, Andrew J. Hanson 

March 1992 ACM Computing Surveys (CSUR), volume 24 issue i 

Full text available: pdf(6.37 MB) Additional Information: full citation , abstract , references , citings , index terms . 

This article reviews the available methods for automated identification of objects in digital images. The technique 
according to the nature of the computational strategy used. Four classes are proposed: (1) the simplest strategic 
appropriate for feature vector classification, (2) methods that match models to symbolic data structures for situ< 
complex models, (3) approaches that fit models to the photometry and ... 

Keywords: image understanding, model-based vision, object recognition 



31 Groupware: some issues and experiences 
Clarence A. Ellis, Simon J. Gibbs, Gail Rein 
January 1991 Communications of the ACM, volume 34 issue i 

Full text available: Q pdf(7.22 MB) Additional Information: full citation , references , citings , index terms 



32 Ca pturin g , structuring, and representing ubiquitous audio 
Debby Hindus, Chris Schmandt, Chris Horner 

October 1993 ACM Transactions on Information Systems (TOIS), volume ii issue 4 

Full text available: ^pdf(1.78 MB ) Additional Information: full citation , abstract , references , citings , index terms 

Although talking is an integral part of collaboration, there has been little computer support for acquiring and aco 
conversations. Our approach has focused on ubiquitous audio, or the unobtrusive capture of speech interactions 
Speech recognition technology cannot yet transcribe fluent conversational speech, so the words themselves are 
captured interactions. Instead, the structure of an int ... 

Keywords: audio interactions, collaborative work, multimedia workstation software, semi -structured data, softv 
ubiquitous computing 
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Marti A. Hearst 

March 1997 Computational Linguistics, volume 23 issue i 

Full text available: ^ p(jf(2.46 MB)I ^ Publisher Site Additional Information: full citation , abstract , references, citings 

TextTiling is a technique for subdividing texts into multi-paragraph units that represent passages, or subtopics. ' 
major subtopic shifts are patterns of lexical co-occurrence and distribution. The algorithm is fully implemented a 
segmentation that corresponds well to human judgments of the subtopic boundaries of 12 texts. Multi-paragrapf 
useful for many text analysis tasks, including information retrieval and ... 
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AtuI Puri, Alexandres Eleftheriadis 

June 1998 Mobile Networks and Applications, volume 3 issue i 

Full text available: ^ pdf(747.80 KB) Additional Information: full citation , abstract , references , citings , index terms . 

The ISO MPEG committee, after successful completion of the MPEG-1 and the MPEG-2 standards is currently wor 
standard. Originally, MPEG-4 was conceived to be a standard for coding of limited complexity audio-visual scene 
in July 1994, its scope was expanded to include coding of scenes as a collection of individual audio-visual object 
advanced functionalities not supported by other standards. One of the ke ... 
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October 2000 Proceedings of tlie eighth ACM international conference on Multimedia 

Full text available: ^ pdf(1.04 MB) Additional Information: full citation , abstract , references, citinos . index temris 

The detection of events is essential to high-level semantic querying of video databases. It is also a very challeng 
detection and integration of evidence for an event available in multiple information modalities, such as audio, vie 
focuses on the detection of specific types of events, namely, topic of discussion events that occur in classroom/U 
we present a query-driven approach to the detection of topic of ... 
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We give a comprehensive report on our experiments with retrieval from OCR-generated text using systems base 
More specifically, we show that average precision and recall Is not affected by OCR errors across systems for sev 
used in these experiments include both actual OCR-generated text and standard information retrieval collections 
of OCR errors. Both the actual and simulation experiments inc ... 
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Full text available: ^ pdf(294.13 KB) Additional Information: full citation , abstract , references , index terms 

With the explosive growth of the World Wide Web, the public is gaining access to massive amounts of informatio 
relevant information remains a difficult task, whether the information Is textual or visual. Text search engines he 
and have achieved a certain degree of success. However, despite the large number of images available on the W 
rare. In this article, we show that in order to allow people to profi ... 
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This work gives an overview of a new technology that is attracting growing Interest In public as well as In the coi 
difference from other technologies Is in the use of a pen or pencil as the primary means of Interaction between a 
the familiar pen and paper Interface metaphor. From this follows a set of consequences that will be analyzed anc 
emerging technologies and visions. Starting with a short historic ... 

^2 An interactive comic book presentation for exploring video 

John Boreczky, Andreas Girgensohn, Gene Golovchinsky, Shingo Uchihashi 

April 2000 Proceedings of the SIGCHI conference on Human factors in computing systems 

Full text available: ^ pdf(1.62 MB) Additional Infonmation: full citation , abstract , references , citings , index terms 

This paper presents a method for generating compact pictorial summarizations of video. We developed a novel a 
from a video suitable for summarizing the video and for providing entry points into it. Images are laid out in a cc 
reminiscent of a comic book or Japanese manga. Users can explore the video by interacting with the presented £ 
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Full text available: ^ pdf(540.75 KB) Additional Information: full citation , abstract , references , citings , index terms 

Increasing amounts of public, corporate, and private speech data are now available on-line. These are limited in 
lack of tools to permit their browsing and search. The goal of our research is to provide tools to overcome the in 
access, by supporting visual scanning, search, and information extraction. We describe a novel principle for the < 
What You See Is Almost What You Hear (WYSIAWYH). In WYSI ... 
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Full text available: ^pdf(618.98 KB) Additional Infonmation: full citation , abstract, references , index tenns 

Past work on multimedia analysis has shown the utility of detecting specific temporal patterns for different conte 
propose a unified, content-adaptive, unsupervised mining framework to bring out such temporal patterns from d 
formulate the problem of pattern discovery from video as a time series clustering problem. We treat the sequenc 
features extracted from the video as a time series and perform a tempor ... 

Keywords: audio classification, time series analysis, video summarization 



47 Inte g rated technolo gi es for indexing spoken langua ge 

Francis Kubala, Sean Colbath, Daben Liu, Amit Srivastava, John Makhoul 
February 2000 Communications of the ACM, volume 43 issue 2 
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Full text available: ^pdf(4 14.60 KB) Additional Information: full citation , abstract , references , index terms 

Combining retrieval results from multiple modalities plays a crucial role for video retrieval systems, especially foi 
systems without any user feedback and query expansion. However, most of current systems only utilize query ir 
explicit user weighting. In this work, we propose using query-class dependent weights within a hierarchial mixtu 
combine multiple retrieval results. We first classify each user query int ... 

Keywords: learning, modality fusion, query class, video retrieval 
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Full text available: ^ pdf(1.15 MB) Additional Information: full citation , abstract , references . Index terms 

The design of robust interfaces that process conversational speech is a challenging research direction largely bee 
variable. This research explored a new dimension of speaker stylistic variation by examining whether users' spec 
the text-to-speech (TTS) heard from a software partner. To pursue this question, a study was conducted in whic 
children conversed with animated partners that embodied different ... 

Keywords: Adaptive interfaces, amplitude, animated characters, children's educational software, communicatio 
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Full text available: ^ pdf(328.98 KB) Additional Information: full citation , abstract , references , citings, index terms 

Queries to text collections are resolved by ranking the documents in the collection and returning the highest-sco 
alternative retrieval method is to rank passages, that is, short fragments of documents, a strategy that can impr 
relevant material in documents that are too large for users to consider as a whole. However, ranking of passage: 
retrieval costs. In this article we explore alternative query evalua ... 

Keywords: inverted files, passage retrieval, query evaluation, text databases, text retrieval 
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Jianhua Tao, Tienlu Tan 

October 2004 Proceedings of the 6th international conference on Multimodal interfaces 

Full text available:^ pdf(327J9 KB) Additional Information: full citation , abstract , references , index terms 

Natural Human-Computer Interface requires integration of realistic audio and visual information for perception a 
talking head system is proposed. The system converts text to speech with synchronized animation of mouth mo^ 
The talking head is based on a generic 3D human head model. The personalized model is incorporated Into the s 
personalized model offers a more natural and realistic look than t ... 

Keywords: emotion, facial animation, speech synthesis, talking head 
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For an animated human face model to appear natural it should produce eye movements consistent with human ( 
face conversational interactions, eyes exhibit conversational turn-taking and agent thought processes through gi 
patterns. We have implemented an eye movement model based on empirical models of saccades and statistical 1 
animations using stationary eyes, eyes with random saccades only, and eyes wit ... 
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Modern window-based user interface systems generate user interface events as natural products of their normal 
can be automatically captured and because they indicate user behavior with respect to an application's user inte 
regarded as a potentially fruitful source of information regarding application usage and usability. However, becai 
typically voluminos and rich In detail, automated support is generally ... 
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This study reports the results of using minimum description length (MDL) analysis to model unsupervised learnir 
segmentation of European languages, using corpora ranging in size from 5,000 words to 500,000 words. We de\ 
rapidly develop a probabilistic morphological grammar, and use MDL as our primary tool to determine whether tl 
heuristics will be adopted or not. The resulting grammar matches well the analysis that ... 

55 Video I: Kev-frame extraction algorithm using entropy difference 
Marl<os Mentzelopoulos, Alexandra Psarrou 

October 2004 Proceedings of the 6th ACM SZGMM international worlcshop on Multimedia information re 

Full text available: ^ pdf(435.21 KB) Additional Information: full citation , abstract , references , index terms 

The fast evolution of the digital video technology has opened new areas of research. The most Important aspect 
perform video cataloguing, indexing and retrieval. The basic step is to find a way for video abstraction, as this w 
large set of video data with sufficient content representation. In this paper we present an overview of the curren 
algorithms. We propose the Entropy- Difference, an algorithm that perf ... 

Keywords: entropy semantics 



56 A methodology and algorithms for the design of hard real-time multitasking ASICs 
Miodrag Potkonjak, Wayne Wolf 

October 1999 ACM Transactions on Design Automation of Eiectronic Systems (TODAES), volume 4 issue 4 
Full text available: ^pdf( 198.48 KB) Additional Information: full citation , abstract , references , index terms , review 

Traditional high-level synthesis concentrates on the implementation of a single task (e.g. filter, linear controller, 
applications— multifunctional embedded controllers intelligent wireless end-points, and DSP and multimedia serv 
computational tasks. This paper describes new techniques for the synthesis of ASIC implementations that realize 
under hard real-time constraints. Our synthes ... 

57 Visual digests for news video libraries 
Michael G. Christel 

October 1999 Proceedings of the seventh ACM international conference on Multimedia (Part 1) 

Full text available: ^ pdf(1.52 MB) Additional Information: full citation , abstract , references , citings, index terms 

The Informedia Digital Video Library contains over 2000 hours of video, growing at a rate of 15 hours per week, 
sufficient for information retrieval because often the candidate result sets grow in number as the library grows. > 
stories from the library, providing users with a visual mechanism for interactive browsing and query refinement, 
dynamically under the direction of the user based on automatically de ... 
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Chinese input presents unique challenges to the field of human computer interaction. This study provides an anc 
standard Chinese input process, which is based on pinyin, a phonetic spelling system in Roman characters. Throi 
performance modeling and experimentation, our study decomposed the Chinese input process Into sub-tasks an» 
and numeric keying, two component resulted from the large number of homophones ... 
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Within the framework of the project NICE (Natural Interactive Communication for Edutainment) [2], we have be 
entertaining computer game that allows children and teenagers to interact with a conversational character inipei 
Andersen (HCA). The rationale behind our system is to make kids learn about HCA's life, fairy tales and historica 
fun. We report on the character's generation and realization of b ... 

Keywords: edutainment, embodied conversational agent, multimodal output, user Interface 
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